Using Data and Text Mining Techniques for Yeast Gene Regulation Prediction: A Case Study
نویسندگان
چکیده
We focus on the problem of predicting yeast gene regulation experiments. In order to construct a good solution, we study combinations of different methods that are not yet to be found in any single data mining application. We describe our approach to propositionalizing the given relational data that describes the interaction among proteins. We study how we can exploit a large archive of scientific publications that contain additional information about the proteins. We present experiments in which we automatically extract additional attributes from the text archive that enhance the predictive results. The studied task is one of the two data mining problems of the KDD Cup 2002; the solution that we describe achieved the highest score in one of the two subtasks and received an “Honorable Mention” for the overall task.
منابع مشابه
Using Combined Descriptive and Predictive Methods of Data Mining for Coronary Artery Disease Prediction: a Case Study Approach
Heart disease is one of the major causes of morbidity in the world. Currently, large proportions of healthcare data are not processed properly, thus, failing to be effectively used for decision making purposes. The risk of heart disease may be predicted via investigation of heart disease risk factors coupled with data mining knowledge. This paper presents a model developed using combined descri...
متن کاملPersonal Credit Score Prediction using Data Mining Algorithms (Case Study: Bank Customers)
Knowledge and information extraction from data is an age-old concept in scientific studies. In industrial decision-making processes, the application of this concept gives rise to data-mining opportunities. Personal credit scoring is an ever-vital tool for banking systems in order to manage and minimize the inherent risks of the financial sector, thus, the design and improvement of credit scorin...
متن کاملPrediction of Acid Mine Drainage Generation Potential of A Copper Mine Tailings Using Gene Expression Programming-A Case Study
This work presents a quantitative predicting likely acid mine drainage (AMD) generation process throughout tailing particles resulting from the Sarcheshmeh copper mine in the south of Iran. Indeed, four predictive relationships for the remaining pyrite fraction, remaining chalcopyrite fraction, sulfate concentration, and pH have been suggested by applying the gene expression programming (GEP) a...
متن کاملArtificial Intelligence for prediction of porosity from Seismic Attributes: Case study in the Persian Gulf
Porosity is one of the key parameters associated with oil reservoirs. Determination of this petrophysical parameter is an essential step in reservoir characterization. Among different linear and nonlinear prediction tools such as multi-regression and polynomial curve fitting, artificial neural network has gained the attention of researchers over the past years. In the present study, two-dimensi...
متن کاملPrediction of Blasting Cost in Limestone Mines Using Gene Expression Programming Model and Artificial Neural Networks
The use of blasting cost (BC) prediction to achieve optimal fragmentation is necessary in order to control the adverse consequences of blasting such as fly rock, ground vibration, and air blast in open-pit mines. In this research work, BC is predicted through collecting 146 blasting data from six limestone mines in Iran using the artificial neural networks (ANNs), gene expression programming (G...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002